Reconfigurable cloud computing

ABSTRACT

A method, system, and computer-readable storage medium for using a distributed computing system are disclosed. For example, one method involves receiving one or more parameters. The one or more parameters indicate one or more operations. The method also involves selecting one or more computing resources from computing resources. This selecting is based on the parameter(s). An application is configured to be executed using the computing resource(s). The method also involves generating a workflow. The workflow indicates that the application is to be executed using the computing resource(s). The workflow indicates that the application performs the operation(s). The method also involves communicating at least a part of the workflow to one or more nodes, where the node(s) include the computing resource(s).

CROSS-REFERENCE TO RELATED APPLICATIONS

The present patent application is a continuation of U.S. patentapplication Ser. No. 14/282,407, filed on May 20, 2014, entitled“Reconfigurable Cloud Computing,” which is a continuation of U.S. patentapplication Ser. No. 13/449,003, filed on Apr. 17, 2012, entitled“Reconfigurable Cloud Computing,” and issued as U.S. Pat. No. 8,775,576on Jul. 8, 2014, and is incorporated by reference herein in its entiretyand for all purposes as if completely and fully set forth herein.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This application relates to distributed execution. Particularly, thisapplication relates to distributed execution of application using acloud computing environment.

2. Description of the Related Art

High Performance Computing (HPC) systems allow users to accesshigh-performance computer resources to execute various workloads. Inrecent years, such HPC systems have been used instead of stand-alonesupercomputers. A distributed HPC system can include multipleworkstations or servers that are connected using a network, such as ahigh-performance network. In a typical distributed HPC system,computational power, storage power, and functionality of HPC devices canbe made available to a user over a network. As a result, distributed HPCsystems can provide high quality of service to users that can be locatedvirtually anywhere in the world.

HPC systems can be used to execute applications that require processingof large data sets, including genomics, seismic, various analytics, andnuclear physics, among many other scientific and industrialapplications. These are examples of very computationally-intensiveapplications, and many users simply do not themselves have thecomputational resources to quickly process these types of data. However,such distributed HPC systems can be difficult to use, and thus limit thetype of users that can perform these HPC tasks.

SUMMARY OF THE INVENTION

Various systems and methods for using a distributed computing system aredisclosed. For example, one method involves receiving one or moreparameters. The one or more parameters indicate one or more operations.The method also involves selecting one or more computing resources fromcomputing resources. This selecting is based on the parameter(s). Anapplication is configured to be executed using the computingresource(s). The method also involves generating a workflow. Theworkflow indicates that the application is to be executed using thecomputing resource(s). The workflow indicates that the applicationperforms the operation(s). The method also involves communicating atleast a part of the workflow to one or more nodes, where the node(s)include the computing resource(s).

BRIEF DESCRIPTION OF THE DRAWINGS

The embodiments of the present application may be better understood, andits numerous objects, features, and advantages made apparent to thoseskilled in the art by referencing the accompanying drawings.

FIG. 1 is a block diagram illustrating a distributed computing system,according to one embodiment.

FIG. 2 is a flowchart illustrating a method for using a distributedcomputing system, according to one or more embodiments.

FIG. 3A is flowchart illustrating a method for using a distributedcomputing system by a server, according to one or more embodiments.

FIG. 3B illustrates an example of parameters used by the method forusing a distributed computing system, according to one embodiment.

FIG. 4 is flowchart illustrating a method using a distributed computingsystem by a node, according to one or more embodiments.

FIG. 5 is flowchart illustrating elements of a method for using adistributed computing system, according to one or more embodiments.

FIGS. 6A and 6B are block diagrams illustrating a server module,according to one or more embodiments.

FIGS. 7A-7C are block diagrams of nodes, according to some embodiments.

FIG. 8A is a block diagram illustrating operation of a distributedcomputing system, according to one embodiment.

FIGS. 8B-8F illustrate examples of various scripts, according to someembodiments.

FIG. 9A and 9B are flowcharts illustrating portion(s) of a method forusing a distributed computing system, according to one embodiment.

FIG. 10 is a block diagram of a job queue, according to one embodiment.

FIG. 11 is a block diagram illustrating operation of a resource,according to one embodiment.

FIG. 12 is a flowchart illustrating a method for scheduling of a nextjob, according to one embodiment.

FIG. 13 is flowchart illustrating a method for determining a next jobscript, according to one embodiment.

FIG. 14 is a block diagram illustrating various components of a storageelement, according to one embodiment.

FIG. 15 is a block diagram illustrating various components of athird-party storage element, according to one embodiment.

FIG. 16 is a block diagram illustrating various components of a servernode, according to one embodiment.

FIG. 17 is a block diagram illustrating various components of a resourcenode, according to one embodiment.

FIG. 18 is a block diagram illustrating a network architecture in whichan embodiment of the present invention can be implemented.

FIG. 19 is a block diagram that illustrates an example of a computersystem suitable for implementing embodiments of the present invention.

While the embodiments of the application are susceptible to variousmodifications and alternative forms, specific embodiments are providedas examples in the drawings and detailed description. It should beunderstood that the drawings and detailed description are not intendedto limit the embodiments to the particular form disclosed. Instead, theintention is to cover all modifications, equivalents and alternativesfalling within the spirit and scope of the invention as defined by theappended claims.

DETAILED DESCRIPTION

Although the present invention has been described in connection withseveral embodiments, the invention is not intended to be limited to thespecific forms set forth herein. On the contrary, it is intended tocover such alternatives, modifications, and equivalents as can bereasonably included within the scope of the invention as defined by theappended claims.

Distributed computing systems (such as distributed High PerformanceComputing (HPC) systems) allow clients to schedule execution of variousapplications. For example, a user can schedule execution of a datamining application to data mine a database for various keywords and/orconcepts. In another example, a user can schedule a genetics applicationto perform DNA sequencing operations. A distributed computing system,such as a cloud-based system, can allow clients to store user data usingcloud storage, and select an application for execution using this storeddata. This application can be stored, accessed, and/or executed, using aremote server. As a result, the user can perform complex operations ondata without incurring costs for expensive server(s), application(s),and/or data storage. Embodiments of such a distributed computing systemare described below.

FIG. 1 is a block diagram illustrating a distributed computing system100 that includes a collection of clients, server(s), and storage.Distributed computing system 100 includes several clients, server(s),and storage, e.g., client(s) 102(1)-102(N), server(s) 104, storage 108,third-party storage 110, and one or more nodes 112(1)-112(N). Each ofclients, server(s), and storage can communicate with each other usingone or more networks, e.g., network 106A and 106B. Each of network 106Aand 106B can include the Internet, a local area network (LAN), a widearea network (WAN), a storage area network (SAN), and/or any combinationthereof. It is noted that distributed computing system 100 may include adifferent number of elements.

Each client, e.g., client(s) 102(1)-102(N), can be implemented as acomputing entity, including but not limited to, a computer, a phone(e.g. a smart phone), a tablet, a virtual machine, among others. Eachclient can access a server, e.g., server(s) 104), such as by issuing arequest to execute an application. Each client can also access, such asusing network 106A, user data that is stored using third-party storage110. Each client can also store, using network 106A, such user data atthird-party storage 110. Each client can provide one or more parametersto the server. These parameters can include location and/or type ofdata, operation(s) to be performed on this data, and/or type and/or nameof application(s), among others. In one implementation, the client canaccess the network, e.g., the Internet, using an Internet browser toprovide such request. The server can access this data, according to theprovided parameters, perform the specified operation(s), and then returnresults to the client(s). In essence, server 104 provides HPC services(e.g., access to application(s)) to client(s) 102(1)-102(N) overnetwork(s), and that can be referred to as using a cloud, since the userdata, the applications that operate on that data, computing nodes thatexecute such applications, and/or the server(s) that control theseoperations are distributed over one or more networks. These HPC servicescan be easily accessed by client(s) 102(1)-102(N), i.e., by requeststhat simply specify parameters that indicate desired operations(s) ondata.

The server includes a server module (e.g., a server module 114). Servermodule 114 can receive a request from the client(s) 102(1)-102(N) vianetwork 106A. Server module 114 can select an application based on thisrequest. For example, the request can include parameters that indicateoperation(s) to be performed on data, and thus server module 114 canselect an application that can perform these operation(s). Server module114 can select computing resources for this application. Server module114 can communicate over network 106B with nodes 112(1)-112(N) to sendcommunication (e.g., based on a workflow) to execute the applicationusing the selected computing resources. Server module 114 can receivethe execution results of the application from the node(s). Server module114 can then return these results to the client that initiated therequest. Server module 114 can access various models and data in storage108 during operation. Examples of server 104 are provided in FIG. 16,and more generally, in FIGS. 18 and 19. Examples of server module 114are provided in FIGS. 6A and 6B, among others.

Each node 112(1)-112(2) can include one or more computing resources. Anode, e.g., node 112(1), can receive communication, over network 106B,(e.g., based on a workflow) from server module 114 to execute anapplication using one or more computing resources. The application canaccess the data from third-party storage 110 during such execution, asspecified by the parameters. The node(s) can return results of theapplication execution to server module 114. Examples of nodes112(1)-112(N) are provided in FIGS. 7A-7C, and FIG. 17, and moregenerally, in FIGS. 18 and 19.

Each client, e.g., 102(1)-102(N), can access, such as over network 106B,third-party storage, e.g., 110. Third-party storage 110 can include oneor more distributed storage devices and/or external cloud storage, amongothers. Third-party storage 110 can store data, such as data that isstored by the client(s). This stored data that can be operated on by theapplication. In one implementation, third-party storage can beimplemented by a cloud storage element, allowing client(s) to upload andstore their data separately from the client(s).

The network, e.g., network 106A and/or 106B, can include the Internetand/or other network(s), such as LAN, WAN, and/or SAN. The network isconfigured to allow communication between the client(s), server(s),and/or storage. In one implementation, the client(s) can access otherelements of the distributed computing system using a first type ofnetwork (e.g., WAN), whereas the server(s) can access other elements ofthe distributed computing system using a second type of a network (e.g.,LAN).

FIG. 2 is a flowchart illustrating a method 200 for using a distributedcomputing system, according to one or more embodiments. As will beappreciated in light of the present disclosure, this method may bemodified in order to derive alternative embodiments. Also, theoperations in this embodiment are shown in sequential order. However,certain operations may occur in a different order than shown, certainoperations may be performed concurrently, certain operations may becombined with other operations, and certain operations may be absent inanother embodiment. Method 200 is described with reference to variationsof the elements described in connection with FIG. 1. In one embodiment,method 200 is implemented by server module 114.

In element 202, a request to perform one or more operations on data isreceived, according to one embodiment. For example, server module 114can receive, using network 106A, this request from client 102(1). Therequest can include various parameters that indicate, for example, theoperations to be performed and the data on which the operations are tooperate, among others.

In element 204, application and resource(s) are selected based on theparameters, according to one embodiment. For example, server module 114can select an application and one or more computing resources, based onthe parameters. The computing resources can be located on one or morenodes (e.g., nodes 112(1)-112(N)). The application can be executed usingthese computing resources, and the application can be configured toexecute to perform the specified operations.

In element 206, data is accessed, according to one embodiment. Forexample, node 112(1) can access stored data that is stored usingthird-party storage 110 (such as using cloud storage). In oneembodiment, the selected computing resource(s) can perform this dataaccess, such as prior to executing the application. It is noted that inone embodiment, the data access of element 206 is performed as a part ofelement 208.

In element 208, the application is executed using the computingresources, according to one embodiment. The application is executedbased on the parameters of the request, according to one embodiment. Inone implementation, the selected computing resource(s) can access thedata (e.g., from third-party storage) during the execution of theapplication.

In element 210, the results of the application execution can be returned(such as over network 106A) to the client(s) that initiated the request.Method 200 is described in more detail below, such as with respect toFIGS. 3 and 4. In some embodiments, various elements of method 200 canbe executed by server module 114, and other elements of method 200 canbe executed by one or more nodes 112(1)-112(N).

FIG. 3 is a flowchart illustrating a method 300 for using a distributedcomputing system, according to one or more embodiments. As will beappreciated in light of the present disclosure, this method may bemodified in order to derive alternative embodiments. Also, theoperations in this embodiment are shown in sequential order. However,certain operations may occur in a different order than shown, certainoperations may be performed concurrently, certain operations may becombined with other operations, and certain operations may be absent inanother embodiment. Method 300 is described with reference to variationsof the elements described in connection with FIG. 1. In one embodiment,method 300 is implemented by server module 114.

In element 302, parameters for execution of an application are received,according to one embodiment. For example, server module 114 can receive,using network 106A, the parameters from client 102(1). A request fromone of the clients can include these parameters. In one embodiment, theparameters can include one or more of the following parameters, as alsoshown in FIG. 3B (which illustrates various parameters 350 that can beprovided using the request):

Application(s)—desired application(s) 356 to be executed, theapplication parameters can include location of executable binaries (suchas on storage);

Operation(s)—desired operation(s) (e.g., functions) of the applicationsto be executed, operation(s) may be listed for each of theapplication(s) specified above, each of the specified operation(s) canalso include specified execution parameters, operation(s) can specify(implicitly or explicitly) any data dependency between any multipleoperations;

Resource(s)—desired resource(s) for execution of each of theapplication(s);

Customer(s)—customer(s) 358 making the request, may include associateduser name, password, email address, and other contact information;

Data file(s)—location, size, and/or data type parameter(s) for the datafile(s) 352 that is to be operated on by the application (e.g., as inputdata), location information can include names and/or path names of datafiles on the third-party storage.

In element 304, computing resource(s) are selected based on theparameters, according to one embodiment. For example, server module 114can select one or more computing resources based on the parameters. Thecomputing resources can be located on one or more nodes (e.g., nodes112(1)-112(N)). Server module 114 can also select an application that isto be executed using the selected computing resource(s).

In element 306, a workflow is generated based on the parameters and theselected computing resources, according to one embodiment. For example,server module 114 can generate such a workflow. The workflow can includevarious components (e.g., scripts), that can indicate execution details,such as which computing resources to use, which application to execute,specific parameters for the computing resource(s) and/or theapplication, among others.

In element 308, communication is sent to the one or more nodes based onthe workflow, according to one embodiment. This communication isconfigured to indicate to the node(s) to execute the application. Forexample, server module 114 can communicate (e.g., over network 106B)instructions to the nodes to initiate execution of the application. Inone embodiment, server module 114 can communicate at least a portion ofthe workflow to one or more of nodes 112(1)-112(N), such as to thenode(s) that include the selected computing resources (i.e., selected inelement 204). In one embodiment, server module 114 can also queue thisworkflow, or elements of the workflow, prior to performing thecommunication of element 308. Server module 114 can then send thecommunication to the node(s) in accordance with a scheduling method. Inone embodiment, the communication that is sent to the node(s) includesjob script(s) and/or configuration script(s). Server module 114 can usethe control script(s) to control the scheduling and/or operation of thejob scripts, and/or configuration of the selected computing resources.

In element 310, the results of execution of the application arereceived, such as by server module 114, according to one embodiment.These results can be sent from one or more nodes 112(1)-112(N), such asfrom the node that includes the selected computing resource where theapplication was executed. In element 212, the results can be returned tothe client(s), such as over network 106A.

FIG. 4 is a flowchart illustrating a method 400 for operating adistributed computing system, according to one or more embodiments. Aswill be appreciated in light of the present disclosure, this method maybe modified in order to derive alternative embodiments. Also, theoperations in this embodiment are shown in sequential order. However,certain operations may occur in a different order than shown, certainoperations may be performed concurrently, certain operations may becombined with other operations, and certain operations may be absent inanother embodiment. Method 400 is described with reference to variationsof the elements described in connection with FIG. 1. In one embodiment,method 400 is implemented by one or more of node(s) 112(1)-112(N).

In element 402, communication from server is received, according to oneembodiment. For example, node 112(1) can receive communication, based onthe workflow, from server module 114. This communication can indicatehow and where to execute an application, as described below. In oneembodiment, this communication can include parts of the workflow, suchas job script(s) and/or configuration script(s). For example, asindicated by a workflow (e.g., control script(s)), a scheduler module(on the server) can instruct (using job script(s)) node(s) to executeselected operation(s) of application(s) using data accessed from thethird-party storage. Examples of script(s) are provided in FIGS. 8B-8F,among others.

In element 404, computing resource(s) are accessed as indicated in theworkflow, according to one embodiment. For example, node 112(1) canaccess one or more computing resources. Node 112(1) can also configureone or more of these resource(s), as indicated in the workflow. In oneembodiment, node 112(1) can perform some scheduling functionality. Forexample, node 112(1) can queue at least portions of the receivedworkflow, and execute these queued workflow portion(s) when the selectedcomputing resources become available.

In element 406, data is accessed, according to one embodiment. Forexample, node 112(1) can access stored data that is stored usingthird-party storage 110 (such as using cloud storage). In oneembodiment, the selected computing resource(s) can perform this dataaccess, such as prior to executing the application. It is noted that inone embodiment, the data access of element 406 is performed duringelement 408.

In element 408, the application is executed, based on the workflow,using parameters and the data, according to one embodiment. For example,the selected computing resources can execute the application. In oneembodiment, the selected computing resource(s) can access the data(e.g., from third part storage) during the execution of the application.The computing resources can be located on one or more nodes (e.g., nodes112(1)-112(N)).

In element 410, the results of execution of the application arecommunicated to the server, according to one embodiment. For example,node(s) 112(1)-112(N) can communicate the execution results to servermodule 114. These results can be sent by the node that includes theselected computing resource where the application was executed.

FIG. 5 is a flowchart illustrating a method 500 for selecting computingresources, according to one or more embodiments. As will be appreciatedin light of the present disclosure, this method may be modified in orderto derive alternative embodiments. Also, the operations in thisembodiment are shown in sequential order. However, certain operationsmay occur in a different order than shown, certain operations may beperformed concurrently, certain operations may be combined with otheroperations, and certain operations may be absent in another embodiment.Method 500 is described with reference to variations of the elementsdescribed in connection with FIG. 1. In one embodiment, method 500 isimplemented by server module 114. In one embodiment, method 500implements at least portions of elements 304 and/or 306 of method 300.

In element 502, parameters are analyzed, according to one embodiment.For example, server module 114 can analyze the parameters of thereceived request. During this analysis, server module 114 can determinethat the parameters indicate location and/or type of data (that is to beaccessed during execution of a certain application), operation(s) to beperformed on this data, and/or type and/or name of application(s), amongothers. For example, the parameters can indicate that certain genesequencing operations are to be performed on data that is located oncloud storage.

In element 504, an application is determined, according to oneembodiment. For example, server module 114 can determine whatapplication is to be executed. Server module 114 can make thisdetermination based on the analysis of the parameters (e.g., on element502). For example, server module 114 can determine that aBurrows-Wheeler Aligner (BWA) application should be used to perform thegene sequencing operation specified by the received request.Furthermore, server module 114 can also determine that two separateapplications should be executed. The first application can be a BWAapplication, that can perform a BWA align operation. The secondapplication can be Sequence Alignment/Map (SAM) application, forexample, which can perform various operations on the output of the BWAalign operation. It is noted that server module 114 can determine (basedon the parameters of the request and/or a model, as described below)that multiple operations of a single application should be performed,such as multiple operations of the SAM application.

In element 506, available computing resources can be determined,according to one embodiment. For example, server module 114 candetermine which of the multiple computing resources of node(s)112(1)-112(N) can be used to execute the application. For example, eachnode can include a different set of computing resources, e.g., node112(1) can include Field Programmable Gate Arrays (FPGA) (and/or anyother type of reconfigurable logic devices, including erasableprogrammable logic devices (EPLDs), complex programmable logic devices(CPLDs), and/or programmable array logic (PALs), commonly referred toherein as FPGAs) computing resources, node 112(2) can include graphicalprocessing unit (GPU), central processing unit (CPU), digital signalprocessing (DSP) resources, or a combination, such as a computingresource having both DSP and FPGA elements. Server module 114 candetermine that the selected application is to be executed using acertain resource of node 112(1). In another example, server module 114can determine that the BWA application (i.e., implementing the alignoperation) is to be executed on one resource of node 112(1), and that acertain operation of the SAM application is to be executed on anotherresource, such as of another node 112(2).

In one embodiment, attributes of the nodes can be accessed when makingthis determination. For example, these attributes can characterizecomputing resources of each node. These attributes can be compared, andthen matched, to any resource requirements that are implicitly and/orexplicitly included in the parameters. In one implementation, eachapplication can be characterized by a model, and each such model caninclude resource requirements for that application and/or operations ofthat application. The resource requirements can also specify anyresources needed to perform the operation(s) of the request.

In element 508, a determination is made as to whether to configure thecomputing resource(s), according to one embodiment. For example, servermodule 114 can determine whether any of the selected resources should beconfigured prior to execution. Element 510 may be executed for eachcomputing resource that is selected (and that can be configured). Thenode hosting each such computing resource can, e.g., during scheduling,determine whether that respective computing resource should beconfigured prior to executing a certain application (as described belowwith reference to FIG. 12). In other words, server module 114 candetermine whether to perform a configuration of the computing resources(e.g., hardware FPGA and/or GPU resources, among others) based onrequest from the client.

However, in one embodiment, this request does not include anyreconfiguration requests, and instead server module 114 makes thisdetermination. For example, server module 114 can determine that FPGA,GPU, and/or CPU computing resource(s) can execute faster once configuredfor the determined application. Server module 114 can make thisdetermination based on the model and resource attributes of eachcomputing resource. For example, the model can indicate (i.e., inresource requirement(s)) that a certain bit stream of an FPGA is needed.Server module 114 can compare this resource requirement to the resourceattributes of the selected computing resource (i.e., the FPGA). Servermodule 114 can then determine that this FPGA, while can support theresource requirement, should be reconfigured prior to executing thedetermined application. Furthermore, server module 114 can determinethat multiple FPGAs (e.g., on a single resource node) should bereconfigured to support a parallel execution mode (as specified byresource requirements). Similarly, server module 114 can determine thatthe selected GPU/CPU computing resources should be reconfigured (e.g.,to place the GPU/CPU in a different operating mode).

In element 510, a configuration of at least one of the computingresources is determined, according to one embodiment. For example,server module 114 can determine a configuration for the selectedcomputing resource(s). In another embodiment, server module 114 candetermine a respective configuration for each computing resource that isconfigurable, not just for the selected computing resource(s). For bothof these embodiments, the configurations can be used when generating theworkflow. This determination of element 510 is used to generate theworkflow, such as in element 306.

In element 512, the computing resource(s) are indicated for selection,according to one embodiment. For example, server module 114 can indicatethat the computing resources (i.e., as determined in element 506) are tobe selected. Server module 114 can also indicate whether (and how) oneor more of the selected computing resources (i.e., as determined inelements 508 and/or 510) are to be configured. It is noted that inelement 512, server module 114 can indicate that the application (asdetermined in element 504) is to be executed on the selected computingresources. These selection indications are then used during the workflowgeneration (i.e., element 306).

FIG. 6A is a block diagram 600 of a server module 602, according to oneor more embodiments. Server module 602 can be an implementation ofserver module 114. In one embodiment, server module 602 includes afront-end portal 604 and a back-end system interface 604. Front-endportal 604 can communicate with clients (e.g., clients 112(1)-112(N)),with the storage (e.g., storage 108) and/or third-party storage (e.g.,third-party storage 110), and with back-end system interface 606. In oneembodiment, back-end system interface 606 can communicate with front-endportal, with the storage (e.g., storage 108), and with the nodes (e.g.,nodes 112(1)-112(N). It is noted that server module 602 may include adifferent number of elements.

FIG. 6B is a block diagram 650 of a server module 652, according to oneor more embodiments. Server module 652 can be an implementation ofserver module 114. In one embodiment, server module 652 includes aresource selector 654, a model selector 656, a workflow generator 658, aparameter validator 660, a data validator 662, and scheduler 664, amongothers. It is noted that server module 652 may include a differentnumber of elements. For example, various elements of server module 652can be combined, as desired. Furthermore, one or more of elements ofserver module 652 can be implemented by one or more nodes (e.g., nodes112(1)-112(N). For example, at least some of functionality of scheduler664 can be implemented by one or more nodes. The operation of variouselements of server module 652 is described below with reference to FIGS.8 and 16, among others. Furthermore, one or more of these elements maybe implemented as software and/or hardware modules.

FIGS. 7A-7C are block diagrams 700, 720, and 740, respectively, ofnodes, according to some embodiments. Each such node can include one ormore computing resources and some attributes (e.g., resource attributes)that characterize these computing resources. The computing resources canbe used to execute an application. Depending on the type of computingresource, at least some of such application can reside on the nodeand/or on the computing resource itself. For example, for FPGAs, somefunctions of such application can be implemented by each FPGA. Inanother example, for GPUs, some functions of such application can beexecuted by each GPU. However, other implementations are contemplated.

FIG. 7A illustrates a node 702 that includes FPGA computing resources704(1)-704(N) and one or more attributes 706. In one embodiment, eachFPGA computing resource 704(1)-704(N) can execute an application, asindicated in a workflow. In another embodiment, each FPGA computingresource 704(1)-704(N) can execute a portion of an application, whereasnode 702 can execute (e.g., by using separate processor(s), not shown inthis figure) any remaining functionality of this application.Furthermore, node 702 includes a configurator 708 and a resource manager710, operations of which are described below, e.g., with reference toFIG. 8. It is noted that although only FIG. 7A shows a configurator anda resource manager, in one embodiment, each node can include both ofthese elements. However, in another embodiment, configurator 708 andresource manager 710 of one of the nodes can be used by the remainingnodes.

For example, each FPGA computing resource 704(1)-704(N) can execute oneor more functions (such as algorithm-intensive functions and/orfunctions can be performed in parallel). Each FPGA computing resource704(1)-704(N) can thus execute a separate function (and/or a differentinstance of some function) in parallel, thus increasing an overallperformance of an application. In one embodiment, node 702 can, uponreceiving at least a part of the workflow (i.e., from the servermodule), execute the application. For example, a scheduler on the servermodule can send job scripts to the node, where each job script indicatesan application that is to be executed on a certain FPGA computingresource.

Node 702 and/or individual FPGA computing resource(s) 704(1)-704(N) canaccess data (e.g., from a third-party storage) when executing theapplication. The application that is being executed by node 702 and/orindividual FPGA computing resource(s) 704(1)-704(N) can be stored onnode 702, on the server module, and/or storage 108. Furthermore,attribute(s) 706 can include various attributes for FPGA computingresource(s) 704(1)-704(N) and/or node 702. Attribute(s) 706 can includemetadata about the capabilities of FPGA computing resource(s)704(1)-704(N) and/or node 702. FPGA computing resource(s) 704(1)-704(N)can be configured, such as indicated in the workflow (e.g., byconfiguration script(s)).

FIG. 7B illustrates a node 720 that includes processor computingresources 724(1)-724(N) and one or more attributes 726. In oneembodiment, each processor computing resource 724(1)-724(N) can executean application, as indicated in a workflow. In another embodiment, eachprocessor computing resource 724(1)-724(N) can execute a portion of anapplication, whereas node 722 can execute (e.g., by using separateprocessor(s), not shown in this figure) any remaining functionality ofthis application. Each processor computing resource can be a GPU, and/oranother dedicated processor. For example, each processor computingresource 724(1)-724(N) can execute one or more functions (such ascompute-intensive functions/algorithms and/or functions/algorithms thatcan be performed in parallel). Each processor computing resource724(1)-724(N) can thus execute a separate function (and/or a differentinstance of some function) in parallel, thus increasing an overallperformance of an application. In one embodiment, node 722 can, uponreceiving at least a part of the workflow (i.e., from the servermodule), execute the application. For example, a scheduler on the servermodule can send job scripts to the node(s), where each job scriptindicates an application that is to be executed on a certain processorcomputing resource.

Node 722 and/or individual processor computing resource(s) 724(1)-724(N)can access data (e.g., from a third-party storage) when executing theapplication. The application that is being executed by node 722 and/orindividual FPGA computing resource(s) 724(1)-724(N) can be stored onnode 722, on the server module, and/or storage 108. Furthermore,attribute(s) 726 can include various attributes for processor computingresource(s) 724(1)-724(N) and/or node 722. Attribute(s) 726 can includemetadata about the capabilities of processor computing resource(s)724(1)-724(N) and/or node 722. Processor computing resource(s)724(1)-724(N) can be configured, such as indicated in the workflow(e.g., by configuration script(s)).

FIG. 7C illustrates a node 740 that includes another type of computingresources 744(1)-744(N) and one or more attributes 746. Similar to thecomputing resources of FIGS. 7A and 7B, each computing resource744(1)-744(N) can execute an application or a portion of an application,as indicated in a workflow. Computing resource(s) 744(1)-744(N) can bealso configured, such as indicated in the workflow (e.g., byconfiguration script(s)).

FIG. 8A is a block diagram 800 illustrating operation of a distributedcomputing system, according to one embodiment. FIGS. 8B-8F illustratevarious scripts that can be used and/or generated by the elements ofFIG. 8A, according to some embodiments.

One or more inputs 802(1)-802(N) can be received by a model selector 804and/or a parameter validator 808. Both model selector 804 and parametervalidator 808 can be included in a server module. Input(s) 802(1)-802(N)can be first received by an input/output module (not shown). Inputs802(1)-802(N) include parameters that are received from a client, suchas received in a request (e.g., see element 302). Model selector 804 canaccess model data 806 to select one or more models 810 for anapplication. Model selector 804 selects model(s) 810 based on input(s)802(1)-802(N).

Each model can be associated with one or more applications. The model isassociated with a certain application, where this application canperform the operation(s) described by input(s) 802(1)-802(N). In oneembodiment, input(s) 802(1)-802(N) can identify an application. In thisexample, model selector 804 can check whether this identifiedapplication includes the operation(s) that are to be performed on data,as specified by input(s) 802(1)-802(N). For example, each model caninclude templates on how to generate various scripts, described below,for the application. Each model can also indicate one or more computingresource(s) where the application can be executed. Thus, in oneembodiment, model selector 804 can perform element 504 of method 500,i.e., by selecting a model that corresponds to the applicationidentified by the parameters (of the request).

Parameter validator 808 can validate input(s) 802(1)-802(N) prior tomodel selector 804 selecting model(s) 810. In one embodiment, modelselector 804 also uses data from data storage 832 when selectingmodel(s) 810. The data from data storage 832 can be first verified by adata validator 834 prior to model selector 804 accessing this data(i.e., as validated data 836). For example, model 810 can include a dataformat. Data validator 834 can use data format of model 810 to verifythat the data complies with the data format for the selected model,e.g., as described with reference to FIG. 9A.

Both resource selector 812 and workflow generator 814 receive model(s)810 and/or inputs 802(1)-802(N). Resource selector 812 determinescomputing resources to be selected based on parameter(s) (of therequest). In one embodiment, resource selector 812 can communicate withone or more nodes (e.g., nodes 112(1)-112(N)) in order to determine whatcomputing resources are available on each node. Resource selector 812can access model(s) 810 to determine what computing resource(s) areindicated as being suitable for application execution. Resource selector812 can also determine if any of the determined computing resources canbe configured prior to application execution, and if so, resourceselector 812 can indicate how to configure each such computing resource.In one embodiment, resource selector operates to perform elements506-512 of method 500. Furthermore, in one embodiment, model selector804 and resource selector 812 are configured to perform element 304 ofmethod 300.

Workflow generator 814 generates a workflow 816 that can include one ormore control scripts 818, one or more job scripts 820(1)-820(N), and/orone or more configuration script(s) 822(1)-822(N). Workflow generator814 can generate workflow 816 based on input(s) 802(1)-802(N), model(s)810, as well as any indications and/or determinations made by resourceselector 812, and optionally based on the data, such as the validateddata. Control script(s) 818 are configured to control execution of theother elements of workflow 816. Job script(s) 820(1)-820(N) indicate howto execute the application(s) selected by model selector 804.Configuration script(s) 822(1)-822(N) indicate how to configurecomputing resource(s) that are determined by resource selector 812. Inone embodiment, workflow generator is configured to perform element 306of method 300.

Workflow 816 is communicated to a scheduler 824, a queue 826, a resourcemanager 828, and/or a configurator 830. In one embodiment, each ofscheduler 824, resource manager 828, and configurator 830 can receive adifferent subset of workflow 816. In one embodiment, scheduler 824 canreceive control script(s) 818 and job script(s) 820(1)-820(N), whileconfigurator 830 receives configuration script(s) 822(N).

Scheduler 824 can use control script(s) 818 to control operation of theother elements of workflow 816. For example, control script(s) 818 canindicate the order in which job scripts 820(1)-820(N) are to beaccessed. Scheduler 824 can access queue 826 (such as described belowwith reference to FIG. 10A) as well as a one or more job schedulingalgorithms, one example of which is illustrated in FIG. 11. In oneembodiment, scheduler 824 can be implemented by the server module. Inanother embodiment, scheduler 824 can be implemented in part by theserver module, and in part by one or more nodes.

For example, job script 820(1) can indicate that a BWA application is tobe executed using certain data from data storage 832. Job script 820(2)can indicate that a SAM align application is to be executed usingresults from the execution of this BWA application. Therefore controlscript(s) 818 can indicate that job script 820(1) is accessed (e.g.,executed) prior to job script 820(2) being accessed. Furthermore,control script(s) 818 can indicate that the BWA application (asindicated by job script 820(1)) is executed on a first computingresource (e.g., an FPGA), while the SAM application (as indicated by jobscript 820(2)) is executed on a second resource (e.g., a GPU processor).Control script(s) 818 can also indicate that prior to execution of theBWA application (as indicated by job script 820(1)), the FPGA computingresource (where this execution would take place) is re-configured firstbased on configuration script 822(1). Similarly, control script(s) 818can also indicate that prior to execution of SAM application (asindicated by job script 820(2)), the GPU computing resource (where thisexecution would take place) is re-configured first based onconfiguration script 822(2).

Resource manager 828 can manage the execution of applications on variouscomputing resources. Resource manager 828 can also indicate to scheduler824 which computing resources are busy executing other applications.Configurator 830 can configure various computing resources usingconfiguration script(s), e.g., as indicated by scheduler 824. Forexample, scheduler 824 can indicate to configurator 830 that a certaincomputing resource is to be configured using configuration script822(1). Upon receiving such indications from scheduler 824, configurator830 can reconfigure this computing resource using configuration script822(1). For example, configurator 830 can configure an FPGA resourceusing a configuration script of FIG. 8E.

Below is a sample model, such as model 810 of FIG. 8A, that can be usedto dynamically generate various scripts, such as control script(s) 818,job script(s) 820(1)-820(N), and/or configuration script(s)822(1)-822(N).

# A model to dynamically generate job scripts # number of cpus to usefor sampe step − selected via pe_ncpus=8 ref=$1 seq1=$2 seq2=“”aln_jobname=${alnn1}${alnn2}bwa_aln alnscript=${aln_jobname}_$$.sh echo“#!/bin/sh” > $alnscript echo “#$ PBS -mabe” >> $alnscript echo “#$ PBS-Nbwa-aln” >> $alnscript echo “ ” >> $alnscript echo “echo\”/opt/convey/cnybwa/bin/cnybwa aln $alnopts $ref $seq1 -f $sai1\“ ” >>$alnscript echo “ ” >> $alnscript echo “echo \”--------------\“ ” >>$alnscript if [ -n “$seq2” ] ; then  echo “ ” >> $alnscript  echo “echo\”/opt/convey/cnybwa/bin/cnybwa aln $alnopts $ref $seq2 -f $sai2\“ ” >>$alnscript  echo “ ” >> $alnscript  echo “ ” >> $alnscript  echo “echo\”--------------\“ ” >> $alnscript fi echo “ ” >> $alnscript if [ -n“$seq2” ] ; then  sampe_jobname=${alnn1}${alnn2}bwa_sampe else sampe_jobname=${alnn1}${alnn2}bwa_samse fisampescript=${sampe_jobname}_$$. sh echo “#!/bin/sh” > $sampescript echo“#$ PBS -mabe” >> $sampescript echo “#$ PBS -Nbwa-sampe” >> $sampescriptecho “#$ PBS -1 gpus=1” >> $sampescript echo “ ” >> $sampescript echo“ncpus=$pe_ncpus” >> $sampescript echo “ ” >> $sampescript echo “echo\”--------------\“ ” >> $sampescript echo “ ” >> $sampescript if [ -n“$seq2” ] ; then  echo “echo \“$bwa_path/bwa sampe -t \$ncpus $peopts$ref $sai1 $sai2 $seq1 $seq2 $samfile\” ” >> $sampescript  echo “ ” >>$sampescript else  echo “echo \”$bwa_path/bwa samse -t \$ncpus $peopts$ref $sai1 $seq1 $samfile\ “ ” >> $sampescript  echo “ ” >> $sampescript echo “time $bwa_path/bwa samse -t \$ncpus $peopts $ref $sai1 $seq1$samfile” >> $sampescript  echo “ ” >> $sampescript fi echo “echo\”--------------\“ ” >> $sampescript echo “ ” >> $sampescript

Using the model shown above and parameters from a request, workflowgenerator 814 dynamically generates various scripts. For example, onerequest parameter can indicate that a sampe job script (for a sampeoperation) is to be generated. Also, the generated job script canindicate that a different version of a sampe operation (i.e., bwa_sampeor bwa_samse) is to be used, depending on value of another parameter(i.e., that indicates a type of the sampe operation). Similarly, themodel above can be used to select a type and/or number of resources forexecution of the sample operation. In the example above, type and numberof CPU resources can be selected based on the parameters, such as an“ncpu” parameter, the “-n” parameter. In other implementations, the typeof resource(s), on which the applications are to execute, can also beselected, such as FPGA(s), CPU(s), and/or GPU(s), among others.

FIGS. 8B-8F illustrate examples of scripts that are used by elements ofFIG. 8A, according to some embodiments. FIG. 8B illustrates an exampleof a control script 840, such as control script 818. Control script 840can be generated for a request for a two-step BWA operation. Controlscript 840 can be generated by workflow generator 814 based on theparameters and/or a model (e.g., model 810). Control script 840 canindicate, e.g., to scheduler 824, that a jobscript for a first BWA step842 is to be executed first. Control script 840 can also indicate that ajobscript for a second BWA step 844 is to be executed next, uponsuccessful execution of the first job script (i.e., by using the“afterok” element which is a conditional element, i.e., execute thesecond job script after the execution of the first job script issuccessful). Although not shown in FIG. 8B, control script 840 can alsoindicate that configuration script(s) are to be executed prior toexecution of the first and/or second job scripts. Control script 840 canalso indicate that any such configuration scripts are to be executedonly as necessary.

FIGS. 8C and 8D illustrate example job scripts, according to someembodiments, according to one embodiment. FIG. 8C illustrates a jobscript 850 for a BWA align operation, according to one embodiment. Jobscript 850 includes references 852 to external data (such as data indata storage 832). Job script 850 can indicate location 854 of anapplication as well as of any other supporting files for thatapplication. Job script 850 can also indicate what resource(s) to use.However, in some embodiments, the resource(s) to be used the applicationcan be selected by scheduler 824 instead, such as based on theassociated control script. Job script 850 can also indicate one or moreoutputs 856 (i.e., output files) that are created by execution of theapplication.

FIG. 8D illustrates a job script 860 for a BWA sampe operation,according to one embodiment. Job script can indicate what resource is tobe used when executing a corresponding application (e.g., as indicatedby elements 862). However, in some embodiments, the resource(s) to beused by the application can be selected by scheduler 824 instead. Jobscript 860 includes references 864 to external data (such as data indata storage 832). References 864 can include files generated by anotherjob script, i.e., files 856 indicated by job script 850. Job script 860can indicate location 866 of an application as well as of any othersupporting files for that application. Job script 860 can also indicateone or more outputs 868 (i.e., output files) that are created byexecution of the application.

FIG. 8E illustrates a configuration script 870, according to oneembodiment. Configuration script 870 can be an implementation ofconfiguration script 822(1). Configuration script 870 can includeinstructions to reconfigure a resource. In this case, configurationscript 870 includes instructions to change a bit stream value of an FPGAresource (e.g., FPGA 704(1)). Configuration script 870 can be executedbefore execution of a job script (i.e., that indicates that anapplication is executed on such as FPGA resource). In one embodiment,configuration script 870 can reference one or more hardware descriptionlanguage (HDL) files (e.g., such as VHDL and/or Verilog) that can beused to configure an FPGA. In another embodiment, configuration script870 can reference a configuration file that can reconfigure a GPUresource (e.g., by reconfiguring register(s) and/or operation mode(s) ofthis GPU resource).

FIG. 8F illustrates another example of a control script 880, such ascontrol script 818. Control script 880 can be generated for a requestfor a one-step SWS operation. Control script 880 can be generated byworkflow generator 880 based on the parameters and/or a model (e.g.,model 810). Control script 880 can indicate, e.g., to scheduler 824,that a jobscript for SWS operation 882 is to be executed. Control script880 can also indicate that one or more configuration scripts can beexecuted.

FIG. 9A is a flowchart illustrating a method 900 for validatingparameters, according to one or more embodiments. As will be appreciatedin light of the present disclosure, this method may be modified in orderto derive alternative embodiments. Also, the operations in thisembodiment are shown in sequential order. However, certain operationsmay occur in a different order than shown, certain operations may beperformed concurrently, certain operations may be combined with otheroperations, and certain operations may be absent in another embodiment.Method 900 is described with reference to variations of the elementsdescribed in connection with FIGS. 1 and 8. In one embodiment, method900 is implemented by parameter validator 808. In one embodiment, method900 can be performed as a part of, or prior to, element 304 of method300.

In element 902, a determination is made as to whether parameters shouldbe validated, according to one embodiment. For example, parametervalidator 808 can make this determination. If the determinationindicates that parameters should not be validated, then method 900executes element 910 next. If the determination indicates that theparameters should be validated, then method 900 executes element 904next.

In element 904, parameters are validated, according to one embodiment.For example, parameter validator 808 can perform this validation. In oneimplementation, parameter validator 808 can determine whether theparameters (i.e., of the request) are in proper format (e.g., useable bymodel selector 804), contain a proper number of elements. In otherwords, this validation ensures that the model can be used to generatethe various scripts, and/or that the parameters are sufficient for theselected application for proper execution.

In element 906, a determination is made as to whether the validation wassuccessful, according to one embodiment. For example, parametervalidator 808 can perform this determination. If the determinationindicates that the validation was successful, then method 900 executeselement 910 next. If the determination indicates that the validation wasnot successful, then method 900 executes element 908 next. In element908, an error condition may be raised, such as by server module 114.Upon such an error condition, further processing of the request is notperformed, and instead server module 114 can send an indication of thiserror to the client that initiated this request.

In element 910, computing resources are selected using the validatedparameter(s). For example, server module 114 can perform this element.In one embodiment, element 910 is a variation of element 304, i.e., thecomputing resources are selected using parameters that have been eithervalidated by element 906, or selected using non-validated parameters ifthe parameters do not need to be validated.

FIG. 9B is a flowchart illustrating a method 950 for generating scriptsfrom a model, according to one or more embodiments. As will beappreciated in light of the present disclosure, this method may bemodified in order to derive alternative embodiments. Also, theoperations in this embodiment are shown in sequential order. However,certain operations may occur in a different order than shown, certainoperations may be performed concurrently, certain operations may becombined with other operations, and certain operations may be absent inanother embodiment. Method 950 is described with reference to variationsof the elements described in connection with FIGS. 1 and 8. In oneembodiment, method 950 is implemented by workflow generator 814. In oneembodiment, method 950 can be performed as a part of element 306 ofmethod 300.

In element 952, a model is selected. For example, model selector 804 canselect a model for an application based on the parameters. Each modelcan describe the operations, requirements, etc., of each applicationthat can be executed using the computing resources. It is noted that theactual application is implemented by the nodes, including at least partsof such applications by the computing resources.

In element 954, the workflow is generated for the selected model. Forexample, workflow generator 814 generates workflow 816. Such workflowcan include control script(s), job script(s), and configurationscript(s). Workflow generator 814 can generate such a workflow based oninput(s) as well as any indications and determinations made by aresource selector, in addition to the selected model. Examples of modeland script generation using selected model(s) are described above withreference to FIGS. 8A-8F.

FIG. 10A is a block diagram 1000 illustrating a queue 1002, according toone embodiment. In one embodiment, queue 1002 can be an implementationof queue 826. Queue 1002 can be implemented as a part of, or externallyto, the scheduler. Queue 1002 stores various elements of a workflow,such as job scripts. The order of access to these job scripts can becontrolled by scheduler 824. In one embodiment, a job script is executedwhen it is accessed by the scheduler. In another embodiment, a jobscript may contain instructions (i.e., that can be used by thescheduler) on how to execute an application.

Queue 1002 illustrates multiple job scripts 1004(1)-1004(N). These jobscripts 1004(1)-1004(N) can correspond to workflows for variousrequests. For example, both job script 1004(1) and 1004(2) cancorrespond to a single request to perform gene sequence analysisoperations on a certain data set. Job script 1004(1) is thus generated(e.g., by workflow generator 814) to perform a BWA align operation onthis data. Job script 1004(2) is also generated to perform a SAMfunction on the result of the BWA align operation. An associated controlscript, such as control script 818, indicates to scheduler 824 that jobscript 1004(2) depends on the execution of job script 1004(1). The otherjob scripts 1004(3)-1004(N) can be associated with other request(s).

FIG. 11 is a block diagram 1100 illustrating execution of an applicationusing one or more computing resources, according to one or moreembodiments. FIG. 11 illustrates execution of a job script, such as jobscript 1004(1). This job script can indicate the application that is tobe executed, the computing resources to use, data 1102(1)-1102(N) isprovided to computing resources 1104(1)-1104(N), as well as toapplication 1106. It is noted that application 1106 can execute usingcomputing resources 1104(1)-1104(N), and as such, the execution paradigmmay vary depending on the type of computing resources 1104(1)-1104(N).After application 1106 executes, computing resources 1104(1)-1104(N) canthen return result(s) 1108 of the execution of application 1106.

For example, for FPGA computing resources 1104(1)-1104(N), each FPGA canbe re-configured (e.g., using a configuration script for the relevantrequest) in order to facilitate execution of application 1106. Forexample, this configuration script can configure FPGA computingresources 1104(1)-1104(N) with at least portions of application 1106(e.g., with certain functions of application 1106 that can be executedin parallel by multiple FPGAs). The remaining portion of application1106 can execute on the node(s) that contain these FPGAs. An exampleconfiguration script is described above with reference to FIG. 8E, andexample operation of using such a configuration script is describedbelow with reference to FIG. 12, among others.

For example, for GPU computing resources 1104(1)-1104(N), each GPU canalso be re-configured (e.g., using a configuration script for therelevant request) in order to facilitate execution of application 1106.For example, this configuration script can configure GPU computingresources 1104(1)-1104(N) with various parameters for application 1106.GPU computing resources 1104(1)-1104(N) can then execute application1106.

FIG. 12 is a flowchart illustrating a method 1200 for operation of ascheduler, according to one or more embodiments. As will be appreciatedin light of the present disclosure, this method may be modified in orderto derive alternative embodiments. Also, the operations in thisembodiment are shown in sequential order. However, certain operationsmay occur in a different order than shown, certain operations may beperformed concurrently, certain operations may be combined with otheroperations, and certain operations may be absent in another embodiment.Method 1200 is described with reference to variations of the elementsdescribed in connection with FIGS. 1 and 8. In one embodiment, method1200 is implemented by scheduler 824. In one embodiment, method 1200 canbe performed as a part of, or prior to, elements 308 of method 300 and408 of method 400.

In element 1202, a next job script is determined, according to oneembodiment. For example, the scheduler can determine that the next jobscript is job script 1004(1) from queue 1002. In another embodiment, thescheduler can determine a different next job script, such as job script1004(3). The scheduler can select the next job script based on variousparameters, as described below with reference to FIG. 13.

In element 1204, a determination is made as to whether the computingresource(s) should be configured. For example, the scheduler can makethis determination based on the control script associated with theselected job script, on the requirements specified in the job script,and/or on a current state of the determined computing resource(s). Inother words, the scheduler can configure certain computing resource(s)as indicated by the control script. The resource manager can alsoindicate to the scheduler which computing resources are busy executingother applications. The scheduler would then determine to use/configurecomputing resources that are not busy executing other applications.

In element 1206, the computing resource(s) are configured. For example,the determined computing resource(s) are configured by configurator 830if element 1204 determines that the computing resource(s) should beconfigured. In one embodiment, configurator 830 can execute aconfiguration script (i.e., that is associated with the selected jobscript/part of a current workflow). In another embodiment, configurator830 can execute configuration file(s) referenced by the configurationscript(s) (e.g., that are associated with the selected job script). Thetype of these configuration files(s) may vary depending on the type ofcomputing resource(s) being configured. An example configuration scriptis described above with reference to FIG. 8E, among others.

In element 1208, execution of an application is initiated according tothe next job script. For example, the scheduler can initiate executionof the application according to the selected job script (and,optionally, the associated control script). The execution of theapplication is described more fully above with reference to FIG. 11.

In element 1210, a determination is made as to whether the job queue isempty. The scheduler can make this determination. If the job queue isnot empty, then element 1202 is performed next. If the job queue isempty, then method 1200 can end (or pause until the next job script inplaced in the queue).

FIG. 13 is a flowchart illustrating a method 1300 for determining a nextjob script, according to one or more embodiments. As will be appreciatedin light of the present disclosure, this method may be modified in orderto derive alternative embodiments. Also, the operations in thisembodiment are shown in sequential order. However, certain operationsmay occur in a different order than shown, certain operations may beperformed concurrently, certain operations may be combined with otheroperations, and certain operations may be absent in another embodiment.Method 1300 is described with reference to variations of the elementsdescribed in connection with FIGS. 1 and 8. In one embodiment, method1300 is implemented by scheduler 824. In one embodiment, method 1300 canbe performed as a part of element 1202 of method 1200.

In element 1302, available computing resources are determined, accordingto one embodiment. For example, the scheduler can examine whether theworkflow (e.g., job script(s) and/or associated control script)indicates what computing resource(s) to use for execution of theapplication associated with the next job script. For example, thedetermined computing resource(s) (e.g., determined in element 302) canbe used, at the time of this determination, to execute anotherapplication, may be in the process of being re-configured, and/or maybeunavailable for another reason. If the determination indicates that thedetermined computing resource(s) are unavailable, method 1100 executeselement 1102 next. During the next execution of element 1102, the method1100 may keep track of which computing resources are unavailable.

In element 1304, dependencies of executing application(s) aredetermined, according to one embodiment. Specifically, the scheduler candetermine the dependencies of applications that are currently executing.Each application can correspond to a certain job script. However, someworkflows can include multiple job scripts, and thus each such workflowcan be associated with multiple different applications. For example, asdescribed in some of the examples above, a single workflow can include ajob script for a BWA align operation, and another job script for a SAMfunction. In this case, even if a job script for the SAM function wasqueued right after the job script for the BWA align operation, thescheduler would determine the dependency of the executing job (e.g., theBWA application corresponding to that certain script), and may select adifferent job script.

In element 1306, other job script factor(s) are determined, according toone embodiment. These job script factors may include projected nodeloads (i.e., for nodes that include the computing resources), priorityof each job script in the queue (i.e., where each job script can have anassociated priority, and a higher priority indicates that the associatedjob script is to be executed prior to the lower priority job script(s)),projected time of execution of each application that is alreadyexecuting, projected time of execution of each application to beexecuted (i.e., application(s) associated with the job scripts in thequeue), among others. In one embodiment, the other factors can includethe amount of times resource(s) would need to be reconfigured. Forexample, if job scripts A, B, and C are queued in this order, allreference a first resource, all three job scripts indicate areconfiguration of this resource, and the reconfiguration for jobscripts A and C are the same, then these other factors may be used. As aresult, job scripts A and C are executed and then job B, which wouldonly require one reconfiguration before job scripts A and C and thenanother reconfiguration before job script B, instead of threereconfigurations (i.e., one before each job script).

In element 1308, the next job script is determined based on theavailable computing resources, dependencies of these available computingresources, and/or other job script factors, among others. In element1208, the scheduler initiates an application associated with this nextjob script. In the example above, since the SAM application depends onthe execution of the BWA application, another job script can bedetermined that corresponds to another application being executed.

FIG. 14 is a more detailed block diagram 1400 of a storage 1402,according to one embodiment, such as storage 108 described in FIG. 1. Itis noted that in some embodiments, one or more of elements of storage1402 may not be used. In one embodiment, storage 1402 includes one ormore model(s) 1404(1)-1404(N) and application data 1406(1)-1406(N).Model(s) 1404(1)-1404(N) can be associated with one or moreapplications. The model is selected, such as by a model selector, duringworkflow generation. Each model can include templates on how to generatevarious scripts that indicate how to execute each application, one ormore computing resource(s) where each such application can be executed,and/or how to configure any such computing resource(s), among others. Inone embodiment, application data 1406(1)-1406(N) can include at leastportions of each application. For example, application data1406(1)-1406(N) can include certain functions and/or instructions thatcan be used during workflow generation, e.g., that can be included inthe configuration script(s). In this implementation, the configurationscript can configure the computing resource with suchfunction/instructions that can be executed during execution of suchapplication.

FIG. 15 is a more detailed block diagram 1500 of third-party storage1502, according to one embodiment, such as third-party storage 110described in FIG. 1. It is noted that in some embodiments, one or moreof elements of third-party storage 1502 may not be used. In oneembodiment, third-party storage 1502 includes client data1504(1)-1504(N). Client data 1504(1)-1504(N) can be associated withclients 102(1)-102(N). Client data 1504(1)-1504(N) can include variousdata that is to be processed during application execution, as indicatedby the parameters. For example, client 102(1) can store client data1504(1) using third-party storage 1502. A request from client 102(1) canindicate to perform a BWA align operation on client data 1504(1) andthen a SAM function operation on the result from this BWA alignoperation.

FIG. 16 is a block diagram 1600 of a server node 1602, such as server104, as described in FIG. 1, among others. Server node 1602 includes oneor more processor(s) 1604, a communication module 1606, and memory 1608.It is noted that is some embodiments, one or more of these elements maybe combined. Memory 1608 includes server module 1610 and file system1612. Server module 1610 can be an implementation of server module 114.It is also noted that server module 1610 may be implemented as asoftware and/or hardware module. It is also noted that in someembodiments one or more of elements of server node 1602 may not be used.Communication module 1606 can facilitate communication of server node1602 with the client(s), the node(s), and/or various storage system(s).Processor(s) 1604 can execute one or more of server module 1610 and/orfile system 1612. Server module 1610 can implement at least portions ofmethods 200, 300, 500, 900, 950, 1200, and/or 1300.

FIG. 17 is a block diagram of a resource node 1702, such as nodes112(1)-112(N) and/or nodes 702, 722, 742. Resource node 1100 includesone or more processor(s) 1704, a communication module 1706, memory 1708,and one or more resource(s) 1710(1)-1710(N). It is noted that is someembodiments, one or more of these elements may be combined. Memory 1708includes file system 1712, one or more agents 1714, one or more resourceattributes 1716, and one or more applications 1718. It is also notedthat various modules of resource node 1702 may be implemented as asoftware and/or hardware module(s). It is also noted that in someembodiments one or more of elements of resource node 1702 may not beused. Communication module 1706 can facilitate communication of resourcenode 1702 with the server, other node(s), and/or various storagesystem(s). Processor(s) 1704 can execute one or more of file system1612, agent(s) 1714, and/or one or more applications 1718. Resource(s)1710(1)-1710(N) can be implementations of resources 704(1)-704(N),724(1)-724(N), 744(1)-744(N), and/or 1104(1)-1104(N). Application(s)1718 can be implementation(s) of application 1106. Agent(s) can includeone or more elements of (and/or portions of) the scheduler, resourcemanager, and/or configurator.

Elements of network architecture can be implemented using differentcomputer systems and networks. An example of one such networkenvironment is described below with reference to FIG. 18.

FIG. 18 is a simplified block diagram illustrating a networkarchitecture 1800 in which one or more clients are provided with accessto a server via various network connections. As depicted in FIG. 18,clients 1802(1)-(N) are coupled to a network 1810 (which can be used toimplement network 106A and/or 106B), and so are able to access a server1806 (which can be used to implement server 104 and/or node(s)112(1)-112(N)) via network 1810. Other servers (not shown) can be usedinstead to implement server 104). A client can be implemented using, forexample, a desktop computer, a laptop computer, a workstation, a server,a cell phone, a smart phone, a network-enabled personal digitalassistant (PDA), or the like. An example of network 1810, which can beused by clients 1802(1)-(N) to access server 1806, is the Internet.Alternatively, access to server 1806 can be provided by a local areanetwork (LAN) utilizing Ethernet, IEEE 802.11x, or some othercommunications protocol. As will be appreciated, server 1806 can beaccessed by clients coupled directly thereto (not shown).

As also depicted on FIG. 18, server 1806 is coupled to a server storagedevice 1808, which includes a data volume such as cluster shared volume.Server storage device 1808 can be implemented as a single storage deviceor a collection of storage devices. Server storage device 1808 can alsobe implemented as a storage area network, which couples remote storagedevices to a server (e.g., server 1806), such that the remote storagedevices appear as locally-attached storage devices to the server's OS,for example.

In light of the present disclosure, those of skill in the art willappreciate that server storage device 1808 can be implemented by anytype of computer-readable storage medium, including, but not limited to,internal or external hard disk drives (HDD), optical drives (e.g., CD-R,CD-RW, DVD-R, DVD-RW, and the like), flash memory drives (e.g., USBmemory sticks and the like), tape drives and the like. Alternatively,those of skill in the art will also appreciate that, in light of thepresent disclosure, network architecture 1800 can include othercomponents such as routers, firewalls and the like that are not germaneto the discussion of the present network and will not be discussedfurther herein. Those of skill in the art will also appreciate thatother configurations are possible. For example, clients 1802(1)-(N) canbe directly coupled to server storage device 1808 without the user of aserver or Internet; server 1806 can be used to implement both theclients and the server; network architecture 1800 can be implementedwithout the use of clients 1802(1)-(N); and so on.

As an example implementation of network architecture 1800, server 1806(implemented with a server 104) services requests to data generated byclients 1802(1)-(N) to data stored in server storage device 1808(implemented with third-party storage 110). Other servers (not depicted)can be implemented with server 104. A server module (e.g., server module114) can be implemented using one of the other servers in the mannerillustrated by FIGS. 2, 3, 5, 8, 9B, 12, and/or 13.

FIG. 19 depicts a block diagram of a computer system 1910 suitable forimplementing the present disclosure. Computer system 1910 may beillustrative of various computer systems in distributed computing system100, such as server(s) 104 or nodes 112(1)-112(N), among others.Computer system 1910 includes a bus 1912 which interconnects majorsubsystems of computer system 1910, such as a central processor 1914, asystem memory 1917 (typically RAM, but which may also include ROM, flashRAM, or the like), an input/output controller 1918, an external audiodevice, such as a speaker system 1920 via an audio output interface1922, an external device, such as a display screen 1924 via displayadapter 1926, serial ports 1928 and 1930, a keyboard 1932 (interfacedwith a keyboard controller 1933), a storage interface 1934, a floppydisk drive 1937 operative to receive a floppy disk 1938, a host busadapter (HBA) interface card 1935A operative to connect with a FibreChannel network 1990, a host bus adapter (HBA) interface card 1935Boperative to connect to a SCSI bus 1939, and an optical disk drive 1940operative to receive an optical disk 1942. Also included are a mouse1946 (or other point-and-click device, coupled to bus 1912 via serialport 1928), a modem 1947 (coupled to bus 1912 via serial port 1930), anda network interface 1948 (coupled directly to bus 1912).

Bus 1912 allows data communication between central processor 1914 andsystem memory 1917, which may include read-only memory (ROM) or flashmemory (neither shown), and random access memory (RAM) (not shown), aspreviously noted. The RAM is generally the main memory into which theoperating system and application programs are loaded. The ROM or flashmemory can contain, among other code, the Basic Input-Output system(BIOS) which controls basic hardware operation such as the interactionwith peripheral components. Applications resident with computer system1910 are generally stored on and accessed via a computer readablemedium, such as a hard disk drive (e.g., fixed disk 1944), an opticaldrive (e.g., optical drive 1940), a floppy disk unit 1937, or otherstorage medium. Additionally, applications can be in the form ofelectronic signals modulated in accordance with the application and datacommunication technology when accessed via network modem 1947 orinterface 1948.

Storage interface 1934, as with the other storage interfaces of computersystem 1910, can connect to a standard computer readable medium forstorage and/or retrieval of information, such as a fixed disk drive1944. Fixed disk drive 1944 may be a part of computer system 1910 or maybe separate and accessed through other interface systems. Modem 1947 mayprovide a direct connection to a remote server via a telephone link orto the Internet via an internet service provider (ISP). Networkinterface 1948 may provide a direct connection to a remote server via adirect network link to the Internet via a POP (point of presence).Network interface 1948 may provide such connection using wirelesstechniques, including digital cellular telephone connection, CellularDigital Packet Data (CDPD) connection, digital satellite data connectionor the like.

Many other devices or subsystems (not shown) may be connected in asimilar manner (e.g., document scanners, digital cameras and so on).Conversely, all of the devices shown in FIG. 19 need not be present topractice the present disclosure. The devices and subsystems can beinterconnected in different ways from that shown in FIG. 19. Theoperation of a computer system such as that shown in FIG. 19 is readilyknown in the art and is not discussed in detail in this application.Code for server module 114, agent(s) used by node(s) 112(1)-112(N)and/or for providing use of a distributed computing system (such asdescribed above with reference to methods 200, 300, 400, 500, 900, 950,1200, and/or 1300, and/or the block diagrams 800 and/or 1100), etc., toimplement the present disclosure can be stored in computer-readablestorage media such as one or more of system memory 1917, fixed disk1944, optical disk 1942, or floppy disk 1938. Memory 1920 is also usedfor storing temporary variables or other intermediate information duringthe execution of instructions by the processor 1910. The operatingsystem provided on computer system 1910 may be MS-DOS®, MS-WINDOWS®,OS/2®, UNIX®, Linux®, or another known operating system.

Moreover, regarding the signals described herein, those skilled in theart will recognize that a signal can be directly transmitted from afirst block to a second block, or a signal can be modified (e.g.,amplified, attenuated, delayed, latched, buffered, inverted, filtered,or otherwise modified) between the blocks. Although the signals of theabove described embodiment are characterized as transmitted from oneblock to the next, other embodiments of the present disclosure mayinclude modified signals in place of such directly transmitted signalsas long as the informational and/or functional aspect of the signal istransmitted between blocks. To some extent, a signal input at a secondblock can be conceptualized as a second signal derived from a firstsignal output from a first block due to physical limitations of thecircuitry involved (e.g., there will inevitably be some attenuation anddelay). Therefore, as used herein, a second signal derived from a firstsignal includes the first signal or any modifications to the firstsignal, whether due to circuit limitations or due to passage throughother circuit elements which do not change the informational and/orfinal functional aspect of the first signal.

Although the present invention has been described in connection withseveral embodiments, the invention is not intended to be limited to thespecific forms set forth herein. On the contrary, it is intended tocover such alternatives, modifications, and equivalents as can bereasonably included within the scope of the invention as defined by theappended claims.

Although the present invention has been described in connection withseveral embodiments, the invention is not intended to be limited to thespecific forms set forth herein. On the contrary, it is intended tocover such alternatives, modifications, and equivalents as can bereasonably included within the scope of the invention as defined by theappended claims.

1. A method comprising: identifying a first application to be executed;configuring a hardware computational element in a first manner, whereina compute node comprises the hardware computational element, and theconfiguring the hardware computational element in the first mannerconfigures the hardware computational element to execute at least aportion of the first application according to a first workflow;executing the at least one portion of the first application using thehardware computational element configured in the first manner, accordingto at least a portion of the first workflow; identifying a secondapplication to be executed; configuring the hardware computationalelement in a second manner, wherein the configuring the hardwarecomputational element in the second manner configures the hardwarecomputational element to execute at least a portion of the secondapplication according to a second workflow; and executing the at leastone portion of the second application using the hardware computationalelement configured in the second manner, according to at least a portionof the second workflow.
 2. The method of claim 1, further comprising:receiving the at least one portion of the first workflow at the computenode, wherein the at least one portion of the workflow references afirst plurality of scripts; accessing a first configuration file,wherein the first plurality of scripts comprises the first configurationfile, and the hardware computational element is configured in the firstmanner based on the first configuration file; receiving the at least oneportion of the second workflow at the compute node, wherein the at leastone portion of the workflow references a second plurality of scripts;and accessing a second configuration file, wherein the second pluralityof scripts comprises the second configuration file, and the hardwarecomputational element is configured in the second manner based on thesecond configuration file.
 3. The method of claim 2, wherein thehardware computational element comprises a field-programmable gate array(FPGA), and the FPGA is selected to execute the at least one portion ofthe first application by determining that attributes of the FPGA satisfythe resource requirements for the at least one portion of the firstapplication, and determining that the FPGA can be configured in thefirst manner prior to executing the at least one portion of the firstapplication.
 4. The method of claim 3, wherein the configuring the FPGAin the second manner comprises changing a bit stream of the FPGA.
 5. Themethod of claim 2, wherein the hardware computational element comprisesa graphical processing units (GPU), and the GPU is selected to executethe at least one portion of the first application by determining thatattributes of the GPU satisfy the resource requirements for the at leastone portion of the first application, and determining that the GPU canbe configured in the first manner prior to executing the at least oneportion of the first application.
 6. The method of claim 5, wherein theconfiguring the GPU in the second manner comprises reconfiguring one ormore registers of the GPU and/or changing the operation mode of the GPU.7. The method of claim 2, further comprising: identifying a first jobscript, wherein the first plurality of scripts comprises the first jobscript, and the first job script identifies the at least one portion ofthe first application, the hardware computational element, and the firstconfiguration script; identifying a second job script, wherein thesecond plurality of scripts comprises the second job script, and thesecond job script identifies the at least one portion of the secondapplication, the hardware computational element, and the secondconfiguration script; and accessing at least one control script, whereinthe at least one control script indicates that the first job script isto be executed before the second job script, the at least one controlscript indicates that the hardware computational element is to beconfigured in the first manner prior to executing the at least oneportion of the first application, and the at least one control scriptindicates that the hardware computational element is to be configured inthe second manner prior to executing the at least one portion of thesecond application.
 8. The method of claim 1, wherein the hardwarecomputational element comprises at least one of a field-programmablegate array (FPGA), a graphical processing unit (GPU), a digital signalprocessor (DSP), or a central processing unit (CPU).
 9. The method ofclaim 1, wherein the first workflow and the second workflow aredifferent portions of a single workflow.
 10. A system comprising: acompute node, wherein the compute node comprises a hardwarecomputational element, a configurator, and a resource manager, theconfigurator is configured to identify a first application to beexecuted, configure the hardware computational element in a firstmanner, where the configuring the hardware computational element in thefirst manner configures the hardware computational element to execute atleast a portion of the first application according to a first workflow,identify a second application to be executed, and configure the hardwarecomputational element in a second manner, wherein the configuring thehardware computational element in the second manner configures thehardware computational element to execute at least a portion of thesecond application according to a second workflow, and the resourcemanager is configured to execute the at least one portion of the firstapplication using the hardware computational element configured in thefirst manner, according to at least a portion of the first workflow, andexecute the at least one portion of the second application using thehardware computational element configured in the second manner,according to at least a portion of the second workflow.
 11. The systemof claim 10, wherein the compute node is configured to: receive the atleast one portion of the first workflow, wherein the at least oneportion of the workflow references a first plurality of scripts, accessa first configuration file, wherein the first plurality of scriptscomprises the first configuration file, and the hardware computationalelement is configured in the first manner based on the firstconfiguration file, receive the at least one portion of the secondworkflow at the compute node, wherein the at least one portion of theworkflow references a second plurality of scripts, and access a secondconfiguration file, wherein the second plurality of scripts comprisesthe second configuration file, and the hardware computational element isconfigured in the second manner based on the second configuration file.12. The system of claim 11, wherein the hardware computational elementcomprises a field-programmable gate array (FPGA), and the FPGA isselected to execute the at least one portion of the first application bydetermining that attributes of the FPGA satisfy the resourcerequirements for the at least one portion of the first application, anddetermining that the FPGA can be configured in the first manner prior toexecuting the at least one portion of the first application.
 13. Thesystem of claim 12, wherein the configuring the FPGA in the secondmanner comprises changing a bit stream of the FPGA.
 14. The system ofclaim 11, wherein the hardware computational element comprises agraphical processing units (GPU), the GPU is selected to execute the atleast one portion of the first application by determining thatattributes of the GPU satisfy the resource requirements for the at leastone portion of the first application, and determining that the GPU canbe configured in the first manner prior to executing the at least oneportion of the first application, and the configuring the GPU in thesecond manner comprises reconfiguring one or more registers of the GPUand/or changing the operation mode of the GPU.
 15. The system of claim11, wherein the compute node is configured to: identify a first jobscript, wherein the first plurality of scripts comprises the first jobscript, and the first job script identifies the at least one portion ofthe first application, the hardware computational element, and the firstconfiguration script; identify a second job script, wherein the secondplurality of scripts comprises the second job script, and the second jobscript identifies the at least one portion of the second application,the hardware computational element, and the second configuration script;and access at least one control script, wherein the at least one controlscript indicates that the first job script is to be executed before thesecond job script, the at least one control script indicates that thehardware computational element is to be configured in the first mannerprior to executing the at least one portion of the first application,and the at least one control script indicates that the hardwarecomputational element is to be configured in the second manner prior toexecuting the at least one portion of the second application.
 16. Acomputer program product comprising: a plurality of instructions, thatwhen executed on a computer system, are configured to identify a firstapplication to be executed; configure a hardware computational elementin a first manner, wherein a compute node comprises the hardwarecomputational element, and the configuring the hardware computationalelement in the first manner configures the hardware computationalelement to execute at least a portion of the first application accordingto a first workflow; execute the at least one portion of the firstapplication using the hardware computational element configured in thefirst manner, according to at least a portion of the first workflow;identify a second application to be executed; configure the hardwarecomputational element in a second manner, wherein the configuring thehardware computational element in the second manner configures thehardware computational element to execute at least a portion of thesecond application according to a second workflow; and execute the atleast one portion of the second application using the hardwarecomputational element configured in the second manner, according to atleast a portion of the second workflow.
 17. The computer program productof claim 16, wherein the plurality of instructions are furtherconfigured to: receive the at least one portion of the first workflow atthe compute node, wherein the at least one portion of the workflowreferences a first plurality of scripts; access a first configurationfile, wherein the first plurality of scripts comprises the firstconfiguration file, and the hardware computational element is configuredin the first manner based on the first configuration file; receive theat least one portion of the second workflow at the compute node, whereinthe at least one portion of the workflow references a second pluralityof scripts; and access a second configuration file, wherein the secondplurality of scripts comprises the second configuration file, and thehardware computational element is configured in the second manner basedon the second configuration file.
 18. The computer program product ofclaim 17, wherein the hardware computational element comprises afield-programmable gate array (FPGA), the FPGA is selected to executethe at least one portion of the first application by determining thatattributes of the FPGA satisfy the resource requirements for the atleast one portion of the first application, and determining that theFPGA can be configured in the first manner prior to executing the atleast one portion of the first application, and the configuring the FPGAin the second manner comprises changing a bit stream of the FPGA. 19.The computer program product of claim 17, wherein the hardwarecomputational element comprises a graphical processing units (GPU), theGPU is selected to execute the at least one portion of the firstapplication by determining that attributes of the GPU satisfy theresource requirements for the at least one portion of the firstapplication, and determining that the GPU can be configured in the firstmanner prior to executing the at least one portion of the firstapplication, and the configuring the GPU in the second manner comprisesreconfiguring one or more registers of the GPU and/or changing theoperation mode of the GPU.
 20. The computer program product of claim 17,wherein the plurality of instructions are further configured to:identify a first job script, wherein the first plurality of scriptscomprises the first job script, and the first job script identifies theat least one portion of the first application, the hardwarecomputational element, and the first configuration script; identify asecond job script, wherein the second plurality of scripts comprises thesecond job script, and the second job script identifies the at least oneportion of the second application, the hardware computational element,and the second configuration script; and access at least one controlscript, wherein the at least one control script indicates that the firstjob script is to be executed before the second job script, the at leastone control script indicates that the hardware computational element isto be configured in the first manner prior to executing the at least oneportion of the first application, and the at least one control scriptindicates that the hardware computational element is to be configured inthe second manner prior to executing the at least one portion of thesecond application.